Performance Simulation of Non-blocking Communication in Message-Passing Applications
نویسندگان
چکیده
In our previous work [1], we introduced performance simulation as an instrument to verify hypotheses on causality between locally and spatially distant performance phenomena without altering the application itself. This is accomplished by modifying mpi event traces and using them to simulate hypothetical message-passing behavior. Here, we present enhancements to our approach, which was previously restricted to blocking communication, that now allow us to correctly simulate mpi non-blocking communication. We enhanced the underlying trace data format to record communication requests, and extended the simulator to even retain the inherently non-deterministic behavior of operations such as MPI Waitany.
منابع مشابه
Scheduling Multiple Multicast for Heterogeneous Network of Workstations with Non-Blocking Message-Passing
This paper proposes efficient algorithms for implementing multicast in heterogeneous workstation/PC clusters. Multicast is an important operation in many scientific and industrial applications. Its efficient implementation on distributed-memory machines plays a critical role in the performance of distributed-memory parallel computing applications. Our work distinguishs itself between others in ...
متن کاملEecient S I M Ulation of Message-passing in Distributed-memory Architectures
In this thesis we propose a distributed-memory parallel-computer simulation system called PUPPET (Performance Under a Pseudo-Parallel EnvironmenT). It allows the evaluation of parallel programs run in a pseudo-parallel system, where a single processor is used to multitask the program's processes, as if they were run on the simulated system. This allows development of applications and teaching o...
متن کاملArchitecture Specific Communication Optimizations for Structured Adaptive Mesh-refinement Applications
OF THE THESIS Architecture Specific Communication Optimizations for Structured Adaptive Mesh-Refinement Applications by Taher Saif Thesis Director: Professor Manish Parashar Dynamic Structured Adaptive Mesh Refinement (SAMR) techniques for solving partial differential equations provide a means for concentrating computational effort to appropriate regions in the computational domain. Parallel im...
متن کاملSCALASCA Parallel Performance Analyses of SPEC MPI2007 Applications
The SPEC MPI2007 1.0 benchmark suite provides a rich variety of message-passing HPC application kernels to compare the performance of parallel/distributed computer systems. Its 13 applications use a representative cross-section of programming languages (C/C++/ Fortran, often combined) and MPI programming patterns (e.g., blocking vs. non-blocking vs. persistent point-to-point communication, with...
متن کاملA Message-Passing Distributed Memory Parallel Algorithm for a Dual-Code Thin Layer, Parabolized Navier-Stokes Solver
In this study, the results of parallelization of a 3-D dual code (Thin Layer, Parabolized Navier-Stokes solver) for solving supersonic turbulent flow around body and wing-body combinations are presented. As a serial code, TLNS solver is very time consuming and takes a large part of memory due to the iterative and lengthy computations. Also for complicated geometries, an exceeding number of grid...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009